Overview
Brought to you by YData
Dataset statistics
| Number of variables | 35 |
|---|---|
| Number of observations | 1852394 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 482.3 MiB |
| Average record size in memory | 273.0 B |
Variable types
| Numeric | 21 |
|---|---|
| Text | 8 |
| Categorical | 4 |
| DateTime | 1 |
| Boolean | 1 |
amt_month is highly overall correlated with amt_month_shopping_net_spend and 1 other fields | High correlation |
amt_month_shopping_net_spend is highly overall correlated with amt_month and 1 other fields | High correlation |
amt_year is highly overall correlated with trans_month | High correlation |
count_month_shopping_net is highly overall correlated with amt_month and 1 other fields | High correlation |
first_time_at_merchant is highly overall correlated with unix_time | High correlation |
lat is highly overall correlated with merch_lat | High correlation |
long is highly overall correlated with merch_long and 1 other fields | High correlation |
merch_lat is highly overall correlated with lat | High correlation |
merch_long is highly overall correlated with long and 1 other fields | High correlation |
times_shopped_at_merchant is highly overall correlated with times_shopped_at_merchant_year | High correlation |
times_shopped_at_merchant_year is highly overall correlated with times_shopped_at_merchant | High correlation |
trans_month is highly overall correlated with amt_year | High correlation |
unix_time is highly overall correlated with first_time_at_merchant and 1 other fields | High correlation |
year is highly overall correlated with unix_time | High correlation |
zip is highly overall correlated with long and 1 other fields | High correlation |
is_fraud is highly imbalanced (95.3%) | Imbalance |
amt is highly skewed (γ1 = 40.81280918) | Skewed |
trans_num has unique values | Unique |
dist_between_client_and_merch has unique values | Unique |
amt_month_shopping_net_spend has 276206 (14.9%) zeros | Zeros |
count_month_shopping_net has 276206 (14.9%) zeros | Zeros |
trans_day has 369418 (19.9%) zeros | Zeros |
hour has 60655 (3.3%) zeros | Zeros |
Reproduction
| Analysis started | 2025-04-21 06:48:03.206505 |
|---|---|
| Analysis finished | 2025-04-21 06:56:19.751789 |
| Duration | 8 minutes and 16.55 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
cc_num
Real number (ℝ)
| Distinct | 999 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.1738604 × 1017 |
| Minimum | 6.0416207 × 1010 |
|---|---|
| Maximum | 4.9923464 × 1018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 6.0416207 × 1010 |
|---|---|
| 5-th percentile | 6.3048488 × 1011 |
| Q1 | 1.8004295 × 1014 |
| median | 3.5214173 × 1015 |
| Q3 | 4.6422555 × 1015 |
| 95-th percentile | 4.497914 × 1018 |
| Maximum | 4.9923464 × 1018 |
| Range | 4.9923463 × 1018 |
| Interquartile range (IQR) | 4.4622125 × 1015 |
Descriptive statistics
| Standard deviation | 1.3091153 × 1018 |
|---|---|
| Coefficient of variation (CV) | 3.1364616 |
| Kurtosis | 6.1753558 |
| Mean | 4.1738604 × 1017 |
| Median Absolute Deviation (MAD) | 3.0764709 × 1015 |
| Skewness | 2.8510736 |
| Sum | 5.0088429 × 1018 |
| Variance | 1.7137828 × 1036 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.02704321 × 1013 | 4392 | 0.2% |
| 6.538441737 × 1015 | 4392 | 0.2% |
| 4.642255475 × 1015 | 4386 | 0.2% |
| 6.538891243 × 1015 | 4386 | 0.2% |
| 4.364010865 × 1015 | 4386 | 0.2% |
| 6.011438889 × 1015 | 4385 | 0.2% |
| 3.447098678 × 1014 | 4385 | 0.2% |
| 4.512828415 × 1018 | 4384 | 0.2% |
| 4.586810169 × 1015 | 4384 | 0.2% |
| 4.745996322 × 1012 | 4384 | 0.2% |
| Other values (989) | 1808530 |
| Value | Count | Frequency (%) |
| 6.041620718 × 1010 | 2196 | |
| 6.042292873 × 1010 | 2200 | |
| 6.042309813 × 1010 | 738 | < 0.1% |
| 6.042785159 × 1010 | 743 | < 0.1% |
| 6.048700208 × 1010 | 735 | < 0.1% |
| 6.04905963 × 1010 | 1465 | |
| 6.049559311 × 1010 | 742 | < 0.1% |
| 5.018029536 × 1011 | 2194 | |
| 5.018181333 × 1011 | 8 | < 0.1% |
| 5.018282048 × 1011 | 733 | < 0.1% |
| Value | Count | Frequency (%) |
| 4.992346398 × 1018 | 2922 | |
| 4.989847571 × 1018 | 1471 | |
| 4.980323468 × 1018 | 736 | < 0.1% |
| 4.973530368 × 1018 | 1467 | |
| 4.958589672 × 1018 | 2191 | |
| 4.95682899 × 1018 | 3657 | |
| 4.911818931 × 1018 | 9 | < 0.1% |
| 4.906628656 × 1018 | 3655 | |
| 4.897067971 × 1018 | 1471 | |
| 4.890424427 × 1018 | 2189 |
merchant
Text
| Distinct | 693 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.1 MiB |
Length
| Max length | 43 |
|---|---|
| Median length | 36 |
| Mean length | 23.130553 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | fraud_Rippin, Kub and Mann |
|---|---|
| 2nd row | fraud_Heller, Gutmann and Zieme |
| 3rd row | fraud_Lind-Buckridge |
| 4th row | fraud_Kutch, Hermiston and Farrell |
| 5th row | fraud_Keeling-Crist |
| Value | Count | Frequency (%) |
| and | 677362 | 15.7% |
| llc | 139662 | 3.2% |
| inc | 131148 | 3.0% |
| sons | 104651 | 2.4% |
| ltd | 100896 | 2.3% |
| plc | 94799 | 2.2% |
| group | 72089 | 1.7% |
| fraud_kutch | 15028 | 0.3% |
| fraud_schaefer | 13367 | 0.3% |
| fraud_streich | 13235 | 0.3% |
| Other values (804) | 2956186 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4158232 | 9.7% |
| r | 3851348 | 9.0% |
| d | 3055994 | 7.1% |
| e | 2665745 | 6.2% |
| u | 2654462 | 6.2% |
| n | 2526397 | 5.9% |
| 2466029 | 5.8% | |
| f | 1996096 | 4.7% |
| _ | 1852394 | 4.3% |
| o | 1614017 | 3.8% |
| Other values (45) | 16006184 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 42846898 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 4158232 | 9.7% |
| r | 3851348 | 9.0% |
| d | 3055994 | 7.1% |
| e | 2665745 | 6.2% |
| u | 2654462 | 6.2% |
| n | 2526397 | 5.9% |
| 2466029 | 5.8% | |
| f | 1996096 | 4.7% |
| _ | 1852394 | 4.3% |
| o | 1614017 | 3.8% |
| Other values (45) | 16006184 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 42846898 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 4158232 | 9.7% |
| r | 3851348 | 9.0% |
| d | 3055994 | 7.1% |
| e | 2665745 | 6.2% |
| u | 2654462 | 6.2% |
| n | 2526397 | 5.9% |
| 2466029 | 5.8% | |
| f | 1996096 | 4.7% |
| _ | 1852394 | 4.3% |
| o | 1614017 | 3.8% |
| Other values (45) | 16006184 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 42846898 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 4158232 | 9.7% |
| r | 3851348 | 9.0% |
| d | 3055994 | 7.1% |
| e | 2665745 | 6.2% |
| u | 2654462 | 6.2% |
| n | 2526397 | 5.9% |
| 2466029 | 5.8% | |
| f | 1996096 | 4.7% |
| _ | 1852394 | 4.3% |
| o | 1614017 | 3.8% |
| Other values (45) | 16006184 |
category
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.1 MiB |
| gas_transport | |
|---|---|
| grocery_pos | |
| home | |
| shopping_pos | |
| kids_pets | |
| Other values (9) |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 10.525913 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | misc_net |
|---|---|
| 2nd row | grocery_pos |
| 3rd row | entertainment |
| 4th row | gas_transport |
| 5th row | misc_pos |
Common Values
| Value | Count | Frequency (%) |
| gas_transport | 188029 | |
| grocery_pos | 176191 | |
| home | 175460 | |
| shopping_pos | 166463 | |
| kids_pets | 161727 | |
| shopping_net | 139322 | |
| entertainment | 134118 | |
| food_dining | 130729 | 7.1% |
| personal_care | 130085 | 7.0% |
| health_fitness | 122553 | 6.6% |
| Other values (4) | 327717 |
Length
| Value | Count | Frequency (%) |
| gas_transport | 188029 | |
| grocery_pos | 176191 | |
| home | 175460 | |
| shopping_pos | 166463 | |
| kids_pets | 161727 | |
| shopping_net | 139322 | |
| entertainment | 134118 | |
| food_dining | 130729 | 7.1% |
| personal_care | 130085 | 7.0% |
| health_fitness | 122553 | 6.6% |
| Other values (4) | 327717 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 2042254 | |
| e | 1838696 | |
| o | 1758769 | |
| n | 1705118 | |
| p | 1548294 | 7.9% |
| t | 1538055 | 7.9% |
| _ | 1484860 | 7.6% |
| r | 1310440 | 6.7% |
| i | 1190524 | 6.1% |
| a | 950855 | 4.9% |
| Other values (10) | 4130274 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19498139 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| s | 2042254 | |
| e | 1838696 | |
| o | 1758769 | |
| n | 1705118 | |
| p | 1548294 | 7.9% |
| t | 1538055 | 7.9% |
| _ | 1484860 | 7.6% |
| r | 1310440 | 6.7% |
| i | 1190524 | 6.1% |
| a | 950855 | 4.9% |
| Other values (10) | 4130274 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19498139 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| s | 2042254 | |
| e | 1838696 | |
| o | 1758769 | |
| n | 1705118 | |
| p | 1548294 | 7.9% |
| t | 1538055 | 7.9% |
| _ | 1484860 | 7.6% |
| r | 1310440 | 6.7% |
| i | 1190524 | 6.1% |
| a | 950855 | 4.9% |
| Other values (10) | 4130274 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19498139 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| s | 2042254 | |
| e | 1838696 | |
| o | 1758769 | |
| n | 1705118 | |
| p | 1548294 | 7.9% |
| t | 1538055 | 7.9% |
| _ | 1484860 | 7.6% |
| r | 1310440 | 6.7% |
| i | 1190524 | 6.1% |
| a | 950855 | 4.9% |
| Other values (10) | 4130274 |
amt
Real number (ℝ)
Skewed 
| Distinct | 60616 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 70.063567 |
| Minimum | 1 |
|---|---|
| Maximum | 28948.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.44 |
| Q1 | 9.64 |
| median | 47.45 |
| Q3 | 83.1 |
| 95-th percentile | 195.34 |
| Maximum | 28948.9 |
| Range | 28947.9 |
| Interquartile range (IQR) | 73.46 |
Descriptive statistics
| Standard deviation | 159.25397 |
|---|---|
| Coefficient of variation (CV) | 2.2729927 |
| Kurtosis | 4181.9073 |
| Mean | 70.063567 |
| Median Absolute Deviation (MAD) | 37.46 |
| Skewness | 40.812809 |
| Sum | 1.2978533 × 108 |
| Variance | 25361.828 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.14 | 779 | < 0.1% |
| 1.1 | 745 | < 0.1% |
| 1.04 | 744 | < 0.1% |
| 1.08 | 741 | < 0.1% |
| 1.25 | 737 | < 0.1% |
| 1.2 | 737 | < 0.1% |
| 1.02 | 736 | < 0.1% |
| 1.01 | 735 | < 0.1% |
| 1.22 | 727 | < 0.1% |
| 1.03 | 726 | < 0.1% |
| Other values (60606) | 1844987 |
| Value | Count | Frequency (%) |
| 1 | 332 | |
| 1.01 | 735 | |
| 1.02 | 736 | |
| 1.03 | 726 | |
| 1.04 | 744 | |
| 1.05 | 721 | |
| 1.06 | 671 | |
| 1.07 | 723 | |
| 1.08 | 741 | |
| 1.09 | 720 |
| Value | Count | Frequency (%) |
| 28948.9 | 1 | |
| 27390.12 | 1 | |
| 27119.77 | 1 | |
| 26544.12 | 1 | |
| 25086.94 | 1 | |
| 22768.11 | 1 | |
| 21437.71 | 1 | |
| 19364.91 | 1 | |
| 17897.24 | 1 | |
| 16837.08 | 1 |
first
Text
| Distinct | 355 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.1 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 6.0802977 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Jennifer |
|---|---|
| 2nd row | Stephanie |
| 3rd row | Edward |
| 4th row | Jeremy |
| 5th row | Tyler |
| Value | Count | Frequency (%) |
| christopher | 38112 | 2.1% |
| robert | 30743 | 1.7% |
| jessica | 29236 | 1.6% |
| david | 28564 | 1.5% |
| michael | 28539 | 1.5% |
| james | 28496 | 1.5% |
| jennifer | 24181 | 1.3% |
| john | 23445 | 1.3% |
| mary | 23424 | 1.3% |
| william | 23396 | 1.3% |
| Other values (345) | 1574258 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1438618 | 12.8% |
| e | 1230164 | 10.9% |
| i | 883628 | 7.8% |
| n | 877668 | 7.8% |
| r | 867952 | 7.7% |
| l | 554750 | 4.9% |
| h | 493347 | 4.4% |
| s | 463151 | 4.1% |
| t | 444904 | 4.0% |
| o | 384330 | 3.4% |
| Other values (39) | 3624595 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 11263107 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1438618 | 12.8% |
| e | 1230164 | 10.9% |
| i | 883628 | 7.8% |
| n | 877668 | 7.8% |
| r | 867952 | 7.7% |
| l | 554750 | 4.9% |
| h | 493347 | 4.4% |
| s | 463151 | 4.1% |
| t | 444904 | 4.0% |
| o | 384330 | 3.4% |
| Other values (39) | 3624595 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 11263107 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1438618 | 12.8% |
| e | 1230164 | 10.9% |
| i | 883628 | 7.8% |
| n | 877668 | 7.8% |
| r | 867952 | 7.7% |
| l | 554750 | 4.9% |
| h | 493347 | 4.4% |
| s | 463151 | 4.1% |
| t | 444904 | 4.0% |
| o | 384330 | 3.4% |
| Other values (39) | 3624595 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 11263107 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1438618 | 12.8% |
| e | 1230164 | 10.9% |
| i | 883628 | 7.8% |
| n | 877668 | 7.8% |
| r | 867952 | 7.7% |
| l | 554750 | 4.9% |
| h | 493347 | 4.4% |
| s | 463151 | 4.1% |
| t | 444904 | 4.0% |
| o | 384330 | 3.4% |
| Other values (39) | 3624595 |
last
Text
| Distinct | 486 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.1 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 6.1123751 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Banks |
|---|---|
| 2nd row | Gill |
| 3rd row | Sanchez |
| 4th row | White |
| 5th row | Garcia |
| Value | Count | Frequency (%) |
| smith | 40940 | 2.2% |
| williams | 33661 | 1.8% |
| davis | 31434 | 1.7% |
| johnson | 28590 | 1.5% |
| rodriguez | 24879 | 1.3% |
| martinez | 21246 | 1.1% |
| jones | 19825 | 1.1% |
| lewis | 18293 | 1.0% |
| miller | 16821 | 0.9% |
| gonzalez | 16809 | 0.9% |
| Other values (476) | 1599896 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1122673 | 9.9% |
| r | 941641 | 8.3% |
| a | 926704 | 8.2% |
| n | 869662 | 7.7% |
| o | 832319 | 7.4% |
| l | 698286 | 6.2% |
| s | 696904 | 6.2% |
| i | 622878 | 5.5% |
| t | 412730 | 3.6% |
| h | 327959 | 2.9% |
| Other values (38) | 3870771 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 11322527 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1122673 | 9.9% |
| r | 941641 | 8.3% |
| a | 926704 | 8.2% |
| n | 869662 | 7.7% |
| o | 832319 | 7.4% |
| l | 698286 | 6.2% |
| s | 696904 | 6.2% |
| i | 622878 | 5.5% |
| t | 412730 | 3.6% |
| h | 327959 | 2.9% |
| Other values (38) | 3870771 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 11322527 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1122673 | 9.9% |
| r | 941641 | 8.3% |
| a | 926704 | 8.2% |
| n | 869662 | 7.7% |
| o | 832319 | 7.4% |
| l | 698286 | 6.2% |
| s | 696904 | 6.2% |
| i | 622878 | 5.5% |
| t | 412730 | 3.6% |
| h | 327959 | 2.9% |
| Other values (38) | 3870771 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 11322527 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1122673 | 9.9% |
| r | 941641 | 8.3% |
| a | 926704 | 8.2% |
| n | 869662 | 7.7% |
| o | 832319 | 7.4% |
| l | 698286 | 6.2% |
| s | 696904 | 6.2% |
| i | 622878 | 5.5% |
| t | 412730 | 3.6% |
| h | 327959 | 2.9% |
| Other values (38) | 3870771 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | F |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| F | 1014749 | |
| M | 837645 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| f | 1014749 | |
| m | 837645 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 1014749 | |
| M | 837645 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1852394 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| F | 1014749 | |
| M | 837645 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1852394 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| F | 1014749 | |
| M | 837645 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1852394 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| F | 1014749 | |
| M | 837645 |
street
Text
| Distinct | 999 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.1 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 29 |
| Mean length | 22.231289 |
| Min length | 12 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 561 Perry Cove |
|---|---|
| 2nd row | 43039 Riley Greens Suite 393 |
| 3rd row | 594 White Dale Suite 530 |
| 4th row | 9443 Cynthia Court Apt. 038 |
| 5th row | 408 Bradley Rest |
| Value | Count | Frequency (%) |
| apt | 468297 | 6.4% |
| suite | 437016 | 5.9% |
| island | 32903 | 0.4% |
| michael | 27058 | 0.4% |
| islands | 25611 | 0.3% |
| station | 25602 | 0.3% |
| common | 25585 | 0.3% |
| david | 24853 | 0.3% |
| brooks | 24143 | 0.3% |
| fields | 23400 | 0.3% |
| Other values (1959) | 6253340 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5515414 | 13.4% | |
| e | 2561201 | 6.2% |
| a | 2077034 | 5.0% |
| i | 1851621 | 4.5% |
| t | 1782137 | 4.3% |
| r | 1576757 | 3.8% |
| n | 1523518 | 3.7% |
| s | 1476954 | 3.6% |
| l | 1270600 | 3.1% |
| o | 1251043 | 3.0% |
| Other values (52) | 20294828 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 41181107 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5515414 | 13.4% | |
| e | 2561201 | 6.2% |
| a | 2077034 | 5.0% |
| i | 1851621 | 4.5% |
| t | 1782137 | 4.3% |
| r | 1576757 | 3.8% |
| n | 1523518 | 3.7% |
| s | 1476954 | 3.6% |
| l | 1270600 | 3.1% |
| o | 1251043 | 3.0% |
| Other values (52) | 20294828 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 41181107 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5515414 | 13.4% | |
| e | 2561201 | 6.2% |
| a | 2077034 | 5.0% |
| i | 1851621 | 4.5% |
| t | 1782137 | 4.3% |
| r | 1576757 | 3.8% |
| n | 1523518 | 3.7% |
| s | 1476954 | 3.6% |
| l | 1270600 | 3.1% |
| o | 1251043 | 3.0% |
| Other values (52) | 20294828 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 41181107 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5515414 | 13.4% | |
| e | 2561201 | 6.2% |
| a | 2077034 | 5.0% |
| i | 1851621 | 4.5% |
| t | 1782137 | 4.3% |
| r | 1576757 | 3.8% |
| n | 1523518 | 3.7% |
| s | 1476954 | 3.6% |
| l | 1270600 | 3.1% |
| o | 1251043 | 3.0% |
| Other values (52) | 20294828 |
city
Text
| Distinct | 906 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.1 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 21 |
| Mean length | 8.6526209 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Moravian Falls |
|---|---|
| 2nd row | Orient |
| 3rd row | Malad City |
| 4th row | Boulder |
| 5th row | Doe Hill |
| Value | Count | Frequency (%) |
| city | 30780 | 1.3% |
| west | 27847 | 1.2% |
| saint | 20483 | 0.9% |
| north | 20472 | 0.9% |
| falls | 18286 | 0.8% |
| new | 16857 | 0.7% |
| mount | 16098 | 0.7% |
| lake | 16089 | 0.7% |
| san | 14638 | 0.6% |
| springs | 12414 | 0.5% |
| Other values (929) | 2118136 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1555978 | 9.7% |
| a | 1334959 | 8.3% |
| n | 1173952 | 7.3% |
| o | 1168590 | 7.3% |
| l | 1115539 | 7.0% |
| r | 1070587 | 6.7% |
| i | 1007053 | 6.3% |
| t | 855511 | 5.3% |
| s | 637587 | 4.0% |
| 459706 | 2.9% | |
| Other values (42) | 5648601 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 16028063 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1555978 | 9.7% |
| a | 1334959 | 8.3% |
| n | 1173952 | 7.3% |
| o | 1168590 | 7.3% |
| l | 1115539 | 7.0% |
| r | 1070587 | 6.7% |
| i | 1007053 | 6.3% |
| t | 855511 | 5.3% |
| s | 637587 | 4.0% |
| 459706 | 2.9% | |
| Other values (42) | 5648601 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 16028063 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1555978 | 9.7% |
| a | 1334959 | 8.3% |
| n | 1173952 | 7.3% |
| o | 1168590 | 7.3% |
| l | 1115539 | 7.0% |
| r | 1070587 | 6.7% |
| i | 1007053 | 6.3% |
| t | 855511 | 5.3% |
| s | 637587 | 4.0% |
| 459706 | 2.9% | |
| Other values (42) | 5648601 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 16028063 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1555978 | 9.7% |
| a | 1334959 | 8.3% |
| n | 1173952 | 7.3% |
| o | 1168590 | 7.3% |
| l | 1115539 | 7.0% |
| r | 1070587 | 6.7% |
| i | 1007053 | 6.3% |
| t | 855511 | 5.3% |
| s | 637587 | 4.0% |
| 459706 | 2.9% | |
| Other values (42) | 5648601 |
state
Text
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.1 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NC |
|---|---|
| 2nd row | WA |
| 3rd row | ID |
| 4th row | MT |
| 5th row | VA |
| Value | Count | Frequency (%) |
| tx | 135269 | 7.3% |
| ny | 119419 | 6.4% |
| pa | 114173 | 6.2% |
| ca | 80495 | 4.3% |
| oh | 66627 | 3.6% |
| mi | 65825 | 3.6% |
| il | 62212 | 3.4% |
| fl | 60775 | 3.3% |
| al | 58521 | 3.2% |
| mo | 54904 | 3.0% |
| Other values (41) | 1034174 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 508580 | |
| N | 406389 | 11.0% |
| M | 314756 | 8.5% |
| I | 260547 | 7.0% |
| T | 220136 | 5.9% |
| L | 211461 | 5.7% |
| O | 205755 | 5.6% |
| C | 201235 | 5.4% |
| Y | 188176 | 5.1% |
| X | 135269 | 3.7% |
| Other values (14) | 1052484 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3704788 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 508580 | |
| N | 406389 | 11.0% |
| M | 314756 | 8.5% |
| I | 260547 | 7.0% |
| T | 220136 | 5.9% |
| L | 211461 | 5.7% |
| O | 205755 | 5.6% |
| C | 201235 | 5.4% |
| Y | 188176 | 5.1% |
| X | 135269 | 3.7% |
| Other values (14) | 1052484 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3704788 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 508580 | |
| N | 406389 | 11.0% |
| M | 314756 | 8.5% |
| I | 260547 | 7.0% |
| T | 220136 | 5.9% |
| L | 211461 | 5.7% |
| O | 205755 | 5.6% |
| C | 201235 | 5.4% |
| Y | 188176 | 5.1% |
| X | 135269 | 3.7% |
| Other values (14) | 1052484 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3704788 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 508580 | |
| N | 406389 | 11.0% |
| M | 314756 | 8.5% |
| I | 260547 | 7.0% |
| T | 220136 | 5.9% |
| L | 211461 | 5.7% |
| O | 205755 | 5.6% |
| C | 201235 | 5.4% |
| Y | 188176 | 5.1% |
| X | 135269 | 3.7% |
| Other values (14) | 1052484 |
zip
Real number (ℝ)
High correlation 
| Distinct | 985 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48813.258 |
| Minimum | 1257 |
|---|---|
| Maximum | 99921 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 1257 |
|---|---|
| 5-th percentile | 7208 |
| Q1 | 26237 |
| median | 48174 |
| Q3 | 72042 |
| 95-th percentile | 94569 |
| Maximum | 99921 |
| Range | 98664 |
| Interquartile range (IQR) | 45805 |
Descriptive statistics
| Standard deviation | 26881.846 |
|---|---|
| Coefficient of variation (CV) | 0.55070788 |
| Kurtosis | -1.0960542 |
| Mean | 48813.258 |
| Median Absolute Deviation (MAD) | 23068 |
| Skewness | 0.078949647 |
| Sum | 9.0421387 × 1010 |
| Variance | 7.2263364 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 82514 | 5116 | 0.3% |
| 73754 | 5116 | 0.3% |
| 48088 | 5115 | 0.3% |
| 34112 | 5108 | 0.3% |
| 61454 | 4392 | 0.2% |
| 16114 | 4392 | 0.2% |
| 84540 | 4386 | 0.2% |
| 89512 | 4386 | 0.2% |
| 72476 | 4386 | 0.2% |
| 33872 | 4385 | 0.2% |
| Other values (975) | 1805612 |
| Value | Count | Frequency (%) |
| 1257 | 2923 | |
| 1330 | 1466 | |
| 1535 | 734 | < 0.1% |
| 1545 | 1468 | |
| 1612 | 738 | < 0.1% |
| 1843 | 3652 | |
| 1844 | 2919 | |
| 2180 | 738 | < 0.1% |
| 2630 | 2924 | |
| 2908 | 745 | < 0.1% |
| Value | Count | Frequency (%) |
| 99921 | 14 | < 0.1% |
| 99783 | 2203 | |
| 99747 | 12 | < 0.1% |
| 99746 | 734 | < 0.1% |
| 99323 | 3651 | |
| 99160 | 4362 | |
| 99116 | 15 | < 0.1% |
| 99113 | 1463 | 0.1% |
| 99033 | 3646 | |
| 98836 | 740 | < 0.1% |
lat
Real number (ℝ)
High correlation 
| Distinct | 983 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.539311 |
| Minimum | 20.0271 |
|---|---|
| Maximum | 66.6933 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 20.0271 |
|---|---|
| 5-th percentile | 29.8826 |
| Q1 | 34.6689 |
| median | 39.3543 |
| Q3 | 41.9404 |
| 95-th percentile | 45.8433 |
| Maximum | 66.6933 |
| Range | 46.6662 |
| Interquartile range (IQR) | 7.2715 |
Descriptive statistics
| Standard deviation | 5.0714704 |
|---|---|
| Coefficient of variation (CV) | 0.13159214 |
| Kurtosis | 0.79107707 |
| Mean | 38.539311 |
| Median Absolute Deviation (MAD) | 3.3597 |
| Skewness | -0.19199899 |
| Sum | 71389988 |
| Variance | 25.719812 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 43.0048 | 5116 | 0.3% |
| 36.385 | 5116 | 0.3% |
| 42.5164 | 5115 | 0.3% |
| 26.1184 | 5108 | 0.3% |
| 41.3851 | 4392 | 0.2% |
| 40.6761 | 4392 | 0.2% |
| 36.0244 | 4386 | 0.2% |
| 38.9999 | 4386 | 0.2% |
| 39.5483 | 4386 | 0.2% |
| 34.2853 | 4385 | 0.2% |
| Other values (973) | 1805612 |
| Value | Count | Frequency (%) |
| 20.0271 | 2186 | |
| 20.0827 | 1463 | 0.1% |
| 24.6557 | 3655 | |
| 26.1184 | 5108 | |
| 26.3304 | 741 | < 0.1% |
| 26.3771 | 732 | < 0.1% |
| 26.4215 | 4362 | |
| 26.4722 | 3650 | |
| 26.529 | 2202 | |
| 26.6939 | 1467 | 0.1% |
| Value | Count | Frequency (%) |
| 66.6933 | 12 | < 0.1% |
| 65.6899 | 734 | < 0.1% |
| 64.7556 | 2203 | |
| 55.4732 | 14 | < 0.1% |
| 48.8878 | 4362 | |
| 48.8856 | 2909 | |
| 48.8328 | 2200 | |
| 48.6669 | 1469 | 0.1% |
| 48.6031 | 4376 | |
| 48.4786 | 2916 |
long
Real number (ℝ)
High correlation 
| Distinct | 983 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -90.227832 |
| Minimum | -165.6723 |
|---|---|
| Maximum | -67.9503 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1852394 |
| Negative (%) | 100.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | -165.6723 |
|---|---|
| 5-th percentile | -119.0825 |
| Q1 | -96.798 |
| median | -87.4769 |
| Q3 | -80.158 |
| 95-th percentile | -73.5365 |
| Maximum | -67.9503 |
| Range | 97.722 |
| Interquartile range (IQR) | 16.64 |
Descriptive statistics
| Standard deviation | 13.747895 |
|---|---|
| Coefficient of variation (CV) | -0.15236867 |
| Kurtosis | 1.8375586 |
| Mean | -90.227832 |
| Median Absolute Deviation (MAD) | 8.1527 |
| Skewness | -1.1469188 |
| Sum | -1.671375 × 108 |
| Variance | 189.00461 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -108.8964 | 5116 | 0.3% |
| -98.0727 | 5116 | 0.3% |
| -82.9832 | 5115 | 0.3% |
| -81.7361 | 5108 | 0.3% |
| -91.0391 | 4392 | 0.2% |
| -80.1752 | 4392 | 0.2% |
| -82.7243 | 4391 | 0.2% |
| -119.7957 | 4386 | 0.2% |
| -109.615 | 4386 | 0.2% |
| -90.9288 | 4386 | 0.2% |
| Other values (973) | 1805606 |
| Value | Count | Frequency (%) |
| -165.6723 | 2203 | |
| -156.292 | 734 | < 0.1% |
| -155.488 | 1463 | |
| -155.3697 | 2186 | |
| -153.994 | 12 | < 0.1% |
| -133.1171 | 14 | < 0.1% |
| -124.4409 | 1467 | |
| -124.2174 | 2195 | |
| -124.1587 | 1465 | |
| -124.1437 | 2198 |
| Value | Count | Frequency (%) |
| -67.9503 | 2922 | |
| -68.5565 | 1467 | 0.1% |
| -69.2675 | 743 | < 0.1% |
| -69.4828 | 2931 | |
| -69.9576 | 737 | < 0.1% |
| -69.9656 | 4374 | |
| -70.1031 | 9 | < 0.1% |
| -70.239 | 1455 | 0.1% |
| -70.3001 | 2924 | |
| -70.3457 | 2196 |
city_pop
Real number (ℝ)
| Distinct | 891 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 88643.675 |
| Minimum | 23 |
|---|---|
| Maximum | 2906700 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 23 |
|---|---|
| 5-th percentile | 139 |
| Q1 | 741 |
| median | 2443 |
| Q3 | 20328 |
| 95-th percentile | 525713 |
| Maximum | 2906700 |
| Range | 2906677 |
| Interquartile range (IQR) | 19587 |
Descriptive statistics
| Standard deviation | 301487.62 |
|---|---|
| Coefficient of variation (CV) | 3.4011182 |
| Kurtosis | 37.572846 |
| Mean | 88643.675 |
| Median Absolute Deviation (MAD) | 2188 |
| Skewness | 5.5908046 |
| Sum | 1.6420301 × 1011 |
| Variance | 9.0894784 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 606 | 8049 | 0.4% |
| 1595797 | 7312 | 0.4% |
| 1312922 | 7297 | 0.4% |
| 241 | 6578 | 0.4% |
| 1766 | 6556 | 0.4% |
| 2906700 | 5865 | 0.3% |
| 302 | 5853 | 0.3% |
| 198 | 5850 | 0.3% |
| 276002 | 5849 | 0.3% |
| 1126 | 5841 | 0.3% |
| Other values (881) | 1787344 |
| Value | Count | Frequency (%) |
| 23 | 2915 | |
| 37 | 1469 | 0.1% |
| 43 | 2920 | |
| 46 | 4386 | |
| 47 | 734 | < 0.1% |
| 49 | 1472 | 0.1% |
| 51 | 1470 | 0.1% |
| 52 | 740 | < 0.1% |
| 53 | 3660 | |
| 60 | 1472 | 0.1% |
| Value | Count | Frequency (%) |
| 2906700 | 5865 | |
| 2504700 | 2929 | |
| 2383912 | 737 | < 0.1% |
| 1595797 | 7312 | |
| 1577385 | 3680 | |
| 1526206 | 5113 | |
| 1417793 | 8 | < 0.1% |
| 1382480 | 2913 | 0.2% |
| 1312922 | 7297 | |
| 1263321 | 5141 |
job
Text
| Distinct | 497 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.1 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 38 |
| Mean length | 20.232398 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Psychologist, counselling |
|---|---|
| 2nd row | Special educational needs teacher |
| 3rd row | Nature conservation officer |
| 4th row | Patent attorney |
| 5th row | Dance movement psychotherapist |
| Value | Count | Frequency (%) |
| engineer | 188048 | 4.6% |
| officer | 158202 | 3.8% |
| manager | 87837 | 2.1% |
| scientist | 79740 | 1.9% |
| designer | 74639 | 1.8% |
| surveyor | 70288 | 1.7% |
| teacher | 54865 | 1.3% |
| psychologist | 46856 | 1.1% |
| research | 42426 | 1.0% |
| editor | 40958 | 1.0% |
| Other values (457) | 3270295 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4003951 | 10.7% |
| i | 3407729 | 9.1% |
| r | 3140909 | 8.4% |
| a | 2593110 | 6.9% |
| t | 2547852 | 6.8% |
| n | 2521475 | 6.7% |
| 2261760 | 6.0% | |
| o | 2133314 | 5.7% |
| s | 2064644 | 5.5% |
| c | 1890653 | 5.0% |
| Other values (43) | 10912975 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 37478372 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 4003951 | 10.7% |
| i | 3407729 | 9.1% |
| r | 3140909 | 8.4% |
| a | 2593110 | 6.9% |
| t | 2547852 | 6.8% |
| n | 2521475 | 6.7% |
| 2261760 | 6.0% | |
| o | 2133314 | 5.7% |
| s | 2064644 | 5.5% |
| c | 1890653 | 5.0% |
| Other values (43) | 10912975 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 37478372 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 4003951 | 10.7% |
| i | 3407729 | 9.1% |
| r | 3140909 | 8.4% |
| a | 2593110 | 6.9% |
| t | 2547852 | 6.8% |
| n | 2521475 | 6.7% |
| 2261760 | 6.0% | |
| o | 2133314 | 5.7% |
| s | 2064644 | 5.5% |
| c | 1890653 | 5.0% |
| Other values (43) | 10912975 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 37478372 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 4003951 | 10.7% |
| i | 3407729 | 9.1% |
| r | 3140909 | 8.4% |
| a | 2593110 | 6.9% |
| t | 2547852 | 6.8% |
| n | 2521475 | 6.7% |
| 2261760 | 6.0% | |
| o | 2133314 | 5.7% |
| s | 2064644 | 5.5% |
| c | 1890653 | 5.0% |
| Other values (43) | 10912975 |
dob
Date
| Distinct | 984 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.1 MiB |
| Minimum | 1924-10-30 00:00:00 |
|---|---|
| Maximum | 2005-01-29 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
trans_num
Text
Unique 
| Distinct | 1852394 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.1 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Unique
| Unique | 1852394 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0b242abb623afc578575680df30655b9 |
|---|---|
| 2nd row | 1f76529f8574734946361c461b024d99 |
| 3rd row | a1a22d70485983eac12b5b88dad1cf95 |
| 4th row | 6b849c168bdad6f867558c3793159a81 |
| 5th row | a41d7549acf90789359a9aa5346dcb46 |
| Value | Count | Frequency (%) |
| d71c95ab6b7356dd74389d41df429c87 | 1 | < 0.1% |
| 1765bb45b3aa3224b4cdcb6e7a96cee3 | 1 | < 0.1% |
| 0b242abb623afc578575680df30655b9 | 1 | < 0.1% |
| 1f76529f8574734946361c461b024d99 | 1 | < 0.1% |
| a1a22d70485983eac12b5b88dad1cf95 | 1 | < 0.1% |
| 6b849c168bdad6f867558c3793159a81 | 1 | < 0.1% |
| a41d7549acf90789359a9aa5346dcb46 | 1 | < 0.1% |
| 189a841a0a8ba03058526bcfe566aab5 | 1 | < 0.1% |
| 83ec1cc84142af6e2acf10c44949e720 | 1 | < 0.1% |
| 6d294ed2cc447d2c71c7171a3d54967c | 1 | < 0.1% |
| Other values (1852384) | 1852384 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 3708557 | 6.3% |
| 4 | 3707696 | 6.3% |
| 7 | 3707599 | 6.3% |
| 2 | 3707045 | 6.3% |
| 3 | 3706132 | 6.3% |
| 1 | 3705118 | 6.3% |
| d | 3704966 | 6.3% |
| a | 3704452 | 6.2% |
| 8 | 3704258 | 6.2% |
| c | 3703707 | 6.2% |
| Other values (6) | 22217078 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 59276608 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 9 | 3708557 | 6.3% |
| 4 | 3707696 | 6.3% |
| 7 | 3707599 | 6.3% |
| 2 | 3707045 | 6.3% |
| 3 | 3706132 | 6.3% |
| 1 | 3705118 | 6.3% |
| d | 3704966 | 6.3% |
| a | 3704452 | 6.2% |
| 8 | 3704258 | 6.2% |
| c | 3703707 | 6.2% |
| Other values (6) | 22217078 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 59276608 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 9 | 3708557 | 6.3% |
| 4 | 3707696 | 6.3% |
| 7 | 3707599 | 6.3% |
| 2 | 3707045 | 6.3% |
| 3 | 3706132 | 6.3% |
| 1 | 3705118 | 6.3% |
| d | 3704966 | 6.3% |
| a | 3704452 | 6.2% |
| 8 | 3704258 | 6.2% |
| c | 3703707 | 6.2% |
| Other values (6) | 22217078 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 59276608 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 9 | 3708557 | 6.3% |
| 4 | 3707696 | 6.3% |
| 7 | 3707599 | 6.3% |
| 2 | 3707045 | 6.3% |
| 3 | 3706132 | 6.3% |
| 1 | 3705118 | 6.3% |
| d | 3704966 | 6.3% |
| a | 3704452 | 6.2% |
| 8 | 3704258 | 6.2% |
| c | 3703707 | 6.2% |
| Other values (6) | 22217078 |
unix_time
Real number (ℝ)
High correlation 
| Distinct | 1819583 |
|---|---|
| Distinct (%) | 98.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3586742 × 109 |
| Minimum | 1.325376 × 109 |
|---|---|
| Maximum | 1.3885344 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 1.325376 × 109 |
|---|---|
| 5-th percentile | 1.3300982 × 109 |
| Q1 | 1.3430168 × 109 |
| median | 1.3570893 × 109 |
| Q3 | 1.3745815 × 109 |
| 95-th percentile | 1.3867821 × 109 |
| Maximum | 1.3885344 × 109 |
| Range | 63158356 |
| Interquartile range (IQR) | 31564662 |
Descriptive statistics
| Standard deviation | 18195081 |
|---|---|
| Coefficient of variation (CV) | 0.013391791 |
| Kurtosis | -1.1995793 |
| Mean | 1.3586742 × 109 |
| Median Absolute Deviation (MAD) | 15789076 |
| Skewness | -0.019735681 |
| Sum | 2.5168 × 1015 |
| Variance | 3.3106099 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1335110521 | 4 | < 0.1% |
| 1370050667 | 4 | < 0.1% |
| 1370177227 | 4 | < 0.1% |
| 1381001869 | 4 | < 0.1% |
| 1386957227 | 4 | < 0.1% |
| 1387312599 | 4 | < 0.1% |
| 1387468942 | 4 | < 0.1% |
| 1344074858 | 3 | < 0.1% |
| 1355636572 | 3 | < 0.1% |
| 1336836798 | 3 | < 0.1% |
| Other values (1819573) | 1852357 |
| Value | Count | Frequency (%) |
| 1325376018 | 1 | |
| 1325376044 | 1 | |
| 1325376051 | 1 | |
| 1325376076 | 1 | |
| 1325376186 | 1 | |
| 1325376248 | 1 | |
| 1325376282 | 1 | |
| 1325376308 | 1 | |
| 1325376318 | 1 | |
| 1325376361 | 1 |
| Value | Count | Frequency (%) |
| 1388534374 | 1 | |
| 1388534364 | 1 | |
| 1388534355 | 1 | |
| 1388534349 | 1 | |
| 1388534347 | 1 | |
| 1388534314 | 1 | |
| 1388534284 | 1 | |
| 1388534276 | 1 | |
| 1388534270 | 1 | |
| 1388534238 | 1 |
merch_lat
Real number (ℝ)
High correlation 
| Distinct | 1754157 |
|---|---|
| Distinct (%) | 94.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.538976 |
| Minimum | 19.027422 |
|---|---|
| Maximum | 67.510267 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 19.027422 |
|---|---|
| 5-th percentile | 29.753795 |
| Q1 | 34.740122 |
| median | 39.3689 |
| Q3 | 41.956263 |
| 95-th percentile | 46.002013 |
| Maximum | 67.510267 |
| Range | 48.482845 |
| Interquartile range (IQR) | 7.2161407 |
Descriptive statistics
| Standard deviation | 5.1056039 |
|---|---|
| Coefficient of variation (CV) | 0.13247897 |
| Kurtosis | 0.77423362 |
| Mean | 38.538976 |
| Median Absolute Deviation (MAD) | 3.38992 |
| Skewness | -0.1880969 |
| Sum | 71389368 |
| Variance | 26.067191 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39.545984 | 4 | < 0.1% |
| 41.014694 | 4 | < 0.1% |
| 40.016559 | 4 | < 0.1% |
| 41.973278 | 4 | < 0.1% |
| 39.516582 | 4 | < 0.1% |
| 41.463521 | 4 | < 0.1% |
| 40.062499 | 4 | < 0.1% |
| 41.340895 | 4 | < 0.1% |
| 38.164527 | 4 | < 0.1% |
| 41.522948 | 4 | < 0.1% |
| Other values (1754147) | 1852354 |
| Value | Count | Frequency (%) |
| 19.027422 | 1 | |
| 19.027785 | 1 | |
| 19.027804 | 1 | |
| 19.027849 | 1 | |
| 19.029798 | 1 | |
| 19.031242 | 1 | |
| 19.032277 | 1 | |
| 19.032689 | 1 | |
| 19.033288 | 1 | |
| 19.034282 | 1 |
| Value | Count | Frequency (%) |
| 67.510267 | 1 | |
| 67.441518 | 1 | |
| 67.397018 | 1 | |
| 67.188111 | 1 | |
| 67.064277 | 1 | |
| 66.835174 | 1 | |
| 66.682905 | 1 | |
| 66.679297 | 1 | |
| 66.674714 | 1 | |
| 66.67355 | 1 |
merch_long
Real number (ℝ)
High correlation 
| Distinct | 1809753 |
|---|---|
| Distinct (%) | 97.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -90.22794 |
| Minimum | -166.67157 |
|---|---|
| Maximum | -66.950902 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1852394 |
| Negative (%) | 100.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | -166.67157 |
|---|---|
| 5-th percentile | -119.30928 |
| Q1 | -96.89944 |
| median | -87.440694 |
| Q3 | -80.245108 |
| 95-th percentile | -73.365169 |
| Maximum | -66.950902 |
| Range | 99.720673 |
| Interquartile range (IQR) | 16.654332 |
Descriptive statistics
| Standard deviation | 13.759692 |
|---|---|
| Coefficient of variation (CV) | -0.15249924 |
| Kurtosis | 1.8312584 |
| Mean | -90.22794 |
| Median Absolute Deviation (MAD) | 8.2235005 |
| Skewness | -1.143933 |
| Sum | -1.6713769 × 108 |
| Variance | 189.32913 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -87.830842 | 4 | < 0.1% |
| -87.621011 | 4 | < 0.1% |
| -82.223196 | 4 | < 0.1% |
| -90.85685 | 4 | < 0.1% |
| -74.433003 | 4 | < 0.1% |
| -80.893888 | 4 | < 0.1% |
| -95.822621 | 4 | < 0.1% |
| -81.219189 | 4 | < 0.1% |
| -92.521318 | 4 | < 0.1% |
| -74.618269 | 4 | < 0.1% |
| Other values (1809743) | 1852354 |
| Value | Count | Frequency (%) |
| -166.671575 | 1 | |
| -166.671242 | 1 | |
| -166.670685 | 1 | |
| -166.670132 | 1 | |
| -166.670006 | 1 | |
| -166.66991 | 1 | |
| -166.669812 | 1 | |
| -166.669638 | 1 | |
| -166.666179 | 1 | |
| -166.664828 | 1 |
| Value | Count | Frequency (%) |
| -66.950902 | 1 | |
| -66.952026 | 1 | |
| -66.952352 | 1 | |
| -66.955602 | 1 | |
| -66.955996 | 1 | |
| -66.95654 | 1 | |
| -66.957364 | 1 | |
| -66.958659 | 1 | |
| -66.958751 | 1 | |
| -66.959178 | 1 |
is_fraud
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.1 MiB |
| 0 | |
|---|---|
| 1 | 9651 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1842743 | |
| 1 | 9651 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1842743 | |
| 1 | 9651 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1842743 | |
| 1 | 9651 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1852394 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1842743 | |
| 1 | 9651 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1852394 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1842743 | |
| 1 | 9651 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1852394 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1842743 | |
| 1 | 9651 | 0.5% |
amt_month
Real number (ℝ)
High correlation 
| Distinct | 896534 |
|---|---|
| Distinct (%) | 48.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4153.689 |
| Minimum | 1 |
|---|---|
| Maximum | 43261.89 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 251.28 |
| Q1 | 1344.79 |
| median | 3071.99 |
| Q3 | 5738.47 |
| 95-th percentile | 11792.017 |
| Maximum | 43261.89 |
| Range | 43260.89 |
| Interquartile range (IQR) | 4393.68 |
Descriptive statistics
| Standard deviation | 3909.0054 |
|---|---|
| Coefficient of variation (CV) | 0.94109246 |
| Kurtosis | 6.2031988 |
| Mean | 4153.689 |
| Median Absolute Deviation (MAD) | 2005.26 |
| Skewness | 1.9707692 |
| Sum | 7.6942686 × 109 |
| Variance | 15280323 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.31 | 15 | < 0.1% |
| 1.15 | 15 | < 0.1% |
| 5.29 | 15 | < 0.1% |
| 9.12 | 14 | < 0.1% |
| 1.07 | 14 | < 0.1% |
| 8.95 | 14 | < 0.1% |
| 9.46 | 14 | < 0.1% |
| 9.37 | 13 | < 0.1% |
| 2.77 | 13 | < 0.1% |
| 4.38 | 13 | < 0.1% |
| Other values (896524) | 1852254 |
| Value | Count | Frequency (%) |
| 1 | 4 | < 0.1% |
| 1.01 | 8 | |
| 1.02 | 9 | |
| 1.03 | 7 | |
| 1.04 | 6 | |
| 1.05 | 5 | < 0.1% |
| 1.06 | 6 | |
| 1.07 | 14 | |
| 1.08 | 9 | |
| 1.09 | 8 |
| Value | Count | Frequency (%) |
| 43261.89 | 1 | |
| 43055.12 | 1 | |
| 43047.94 | 1 | |
| 43013.27 | 1 | |
| 42923.81 | 1 | |
| 42917.54 | 1 | |
| 42887.02 | 1 | |
| 42841.05 | 1 | |
| 42818.8 | 1 | |
| 42750.39 | 1 |
amt_year
Real number (ℝ)
High correlation 
| Distinct | 1694572 |
|---|---|
| Distinct (%) | 91.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45305.597 |
| Minimum | 1.02 |
|---|---|
| Maximum | 219086.77 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 1.02 |
|---|---|
| 5-th percentile | 3283.553 |
| Q1 | 17341.423 |
| median | 37439.105 |
| Q3 | 64720.88 |
| 95-th percentile | 115831.99 |
| Maximum | 219086.77 |
| Range | 219085.75 |
| Interquartile range (IQR) | 47379.458 |
Descriptive statistics
| Standard deviation | 35867.522 |
|---|---|
| Coefficient of variation (CV) | 0.79167972 |
| Kurtosis | 1.4120611 |
| Mean | 45305.597 |
| Median Absolute Deviation (MAD) | 22656.73 |
| Skewness | 1.1686746 |
| Sum | 8.3923817 × 1010 |
| Variance | 1.2864792 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5468.43 | 6 | < 0.1% |
| 8598.52 | 5 | < 0.1% |
| 14612.75 | 5 | < 0.1% |
| 1161.57 | 5 | < 0.1% |
| 26549.08 | 5 | < 0.1% |
| 12498.63 | 5 | < 0.1% |
| 9390.26 | 5 | < 0.1% |
| 31282.23 | 5 | < 0.1% |
| 19724.97 | 5 | < 0.1% |
| 13273.31 | 5 | < 0.1% |
| Other values (1694562) | 1852343 |
| Value | Count | Frequency (%) |
| 1.02 | 1 | |
| 1.03 | 1 | |
| 1.04 | 1 | |
| 1.07 | 1 | |
| 1.08 | 1 | |
| 1.13 | 2 | |
| 1.15 | 1 | |
| 1.19 | 1 | |
| 1.2 | 2 | |
| 1.21 | 1 |
| Value | Count | Frequency (%) |
| 219086.77 | 1 | |
| 219073.58 | 1 | |
| 219025.18 | 1 | |
| 218957.58 | 1 | |
| 218955.06 | 1 | |
| 218941.76 | 1 | |
| 218866.61 | 1 | |
| 218824.2 | 1 | |
| 218743.75 | 1 | |
| 218713.06 | 1 |
amt_month_shopping_net_spend
Real number (ℝ)
High correlation  Zeros 
| Distinct | 73861 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 376.2028 |
| Minimum | 0 |
|---|---|
| Maximum | 12047.18 |
| Zeros | 276206 |
| Zeros (%) | 14.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 9.02 |
| median | 75.89 |
| Q3 | 425.98 |
| 95-th percentile | 1717.35 |
| Maximum | 12047.18 |
| Range | 12047.18 |
| Interquartile range (IQR) | 416.96 |
Descriptive statistics
| Standard deviation | 725.35307 |
|---|---|
| Coefficient of variation (CV) | 1.9280906 |
| Kurtosis | 23.999749 |
| Mean | 376.2028 |
| Median Absolute Deviation (MAD) | 75.89 |
| Skewness | 4.0224138 |
| Sum | 6.9687581 × 108 |
| Variance | 526137.08 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 276206 | 14.9% |
| 9.12 | 575 | < 0.1% |
| 9.89 | 528 | < 0.1% |
| 9.35 | 475 | < 0.1% |
| 9.52 | 469 | < 0.1% |
| 4.49 | 465 | < 0.1% |
| 9.2 | 451 | < 0.1% |
| 4.93 | 448 | < 0.1% |
| 7.85 | 436 | < 0.1% |
| 3.17 | 434 | < 0.1% |
| Other values (73851) | 1571907 |
| Value | Count | Frequency (%) |
| 0 | 276206 | |
| 1 | 28 | < 0.1% |
| 1.01 | 418 | < 0.1% |
| 1.02 | 278 | < 0.1% |
| 1.03 | 238 | < 0.1% |
| 1.04 | 269 | < 0.1% |
| 1.05 | 285 | < 0.1% |
| 1.06 | 178 | < 0.1% |
| 1.07 | 269 | < 0.1% |
| 1.08 | 316 | < 0.1% |
| Value | Count | Frequency (%) |
| 12047.18 | 15 | |
| 10812.12 | 3 | < 0.1% |
| 10805.83 | 5 | < 0.1% |
| 10796.17 | 28 | |
| 10790.23 | 3 | < 0.1% |
| 10339.78 | 2 | < 0.1% |
| 10245.7 | 11 | < 0.1% |
| 10242.58 | 1 | < 0.1% |
| 10238.88 | 12 | |
| 10235.86 | 12 |
count_month_shopping_net
Real number (ℝ)
High correlation  Zeros 
| Distinct | 49 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.5672411 |
| Minimum | 0 |
|---|---|
| Maximum | 48 |
| Zeros | 276206 |
| Zeros (%) | 14.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 7 |
| 95-th percentile | 14 |
| Maximum | 48 |
| Range | 48 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.5755024 |
|---|---|
| Coefficient of variation (CV) | 1.0018088 |
| Kurtosis | 4.3978581 |
| Mean | 4.5672411 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.7306414 |
| Sum | 8460330 |
| Variance | 20.935222 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 276206 | |
| 1 | 268131 | |
| 2 | 227701 | |
| 3 | 196162 | |
| 4 | 161418 | |
| 5 | 135485 | |
| 6 | 114389 | |
| 7 | 94026 | 5.1% |
| 8 | 76806 | 4.1% |
| 9 | 62216 | 3.4% |
| Other values (39) | 239854 |
| Value | Count | Frequency (%) |
| 0 | 276206 | |
| 1 | 268131 | |
| 2 | 227701 | |
| 3 | 196162 | |
| 4 | 161418 | |
| 5 | 135485 | |
| 6 | 114389 | |
| 7 | 94026 | 5.1% |
| 8 | 76806 | 4.1% |
| 9 | 62216 | 3.4% |
| Value | Count | Frequency (%) |
| 48 | 9 | < 0.1% |
| 47 | 8 | < 0.1% |
| 46 | 6 | < 0.1% |
| 45 | 6 | < 0.1% |
| 44 | 7 | < 0.1% |
| 43 | 7 | < 0.1% |
| 42 | 20 | < 0.1% |
| 41 | 33 | |
| 40 | 53 | |
| 39 | 40 |
first_time_at_merchant
Boolean
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 1323066 | |
| True | 529328 |
dist_between_client_and_merch
Real number (ℝ)
Unique 
| Distinct | 1852394 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 76.109558 |
| Minimum | 0.022273513 |
|---|---|
| Maximum | 151.8682 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 0.022273513 |
|---|---|
| 5-th percentile | 24.764904 |
| Q1 | 55.341984 |
| median | 78.248227 |
| Q3 | 98.472041 |
| 95-th percentile | 120.45286 |
| Maximum | 151.8682 |
| Range | 151.84593 |
| Interquartile range (IQR) | 43.130056 |
Descriptive statistics
| Standard deviation | 29.092731 |
|---|---|
| Coefficient of variation (CV) | 0.38224806 |
| Kurtosis | -0.6320063 |
| Mean | 76.109558 |
| Median Absolute Deviation (MAD) | 21.44406 |
| Skewness | -0.23773823 |
| Sum | 1.4098489 × 108 |
| Variance | 846.387 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 72.38098966 | 1 | < 0.1% |
| 78.77382075 | 1 | < 0.1% |
| 30.21661841 | 1 | < 0.1% |
| 108.1029117 | 1 | < 0.1% |
| 95.68511548 | 1 | < 0.1% |
| 77.70239516 | 1 | < 0.1% |
| 86.09735764 | 1 | < 0.1% |
| 118.0948553 | 1 | < 0.1% |
| 12.75471404 | 1 | < 0.1% |
| 25.33388259 | 1 | < 0.1% |
| Other values (1852384) | 1852384 |
| Value | Count | Frequency (%) |
| 0.02227351335 | 1 | |
| 0.06673123416 | 1 | |
| 0.09405772594 | 1 | |
| 0.1133855774 | 1 | |
| 0.1241803449 | 1 | |
| 0.1371995071 | 1 | |
| 0.1479746072 | 1 | |
| 0.1538761904 | 1 | |
| 0.2004959165 | 1 | |
| 0.2020716214 | 1 |
| Value | Count | Frequency (%) |
| 151.8682002 | 1 | |
| 150.6737431 | 1 | |
| 150.5801916 | 1 | |
| 149.6101271 | 1 | |
| 149.2055714 | 1 | |
| 148.6236717 | 1 | |
| 148.6038893 | 1 | |
| 148.5283365 | 1 | |
| 148.4270844 | 1 | |
| 148.1560878 | 1 |
trans_month
Real number (ℝ)
High correlation 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.152067 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.4249539 |
|---|---|
| Coefficient of variation (CV) | 0.47887609 |
| Kurtosis | -1.1344614 |
| Mean | 7.152067 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.13015491 |
| Sum | 13248446 |
| Variance | 11.730309 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 280598 | |
| 8 | 176118 | |
| 6 | 173869 | |
| 7 | 172444 | |
| 5 | 146875 | |
| 3 | 143789 | |
| 11 | 143056 | |
| 9 | 140185 | |
| 10 | 138106 | |
| 4 | 134970 | |
| Other values (2) | 202384 |
| Value | Count | Frequency (%) |
| 1 | 104727 | |
| 2 | 97657 | |
| 3 | 143789 | |
| 4 | 134970 | |
| 5 | 146875 | |
| 6 | 173869 | |
| 7 | 172444 | |
| 8 | 176118 | |
| 9 | 140185 | |
| 10 | 138106 |
| Value | Count | Frequency (%) |
| 12 | 280598 | |
| 11 | 143056 | |
| 10 | 138106 | |
| 9 | 140185 | |
| 8 | 176118 | |
| 7 | 172444 | |
| 6 | 173869 | |
| 5 | 146875 | |
| 4 | 134970 | |
| 3 | 143789 |
trans_day
Real number (ℝ)
Zeros 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9674562 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 369418 |
| Zeros (%) | 19.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.1979833 |
|---|---|
| Coefficient of variation (CV) | 0.74069612 |
| Kurtosis | -1.4581808 |
| Mean | 2.9674562 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.0077801963 |
| Sum | 5496898 |
| Variance | 4.8311304 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 369418 | |
| 6 | 343677 | |
| 1 | 270340 | |
| 5 | 263227 | |
| 4 | 215078 | |
| 3 | 206741 | |
| 2 | 183913 |
| Value | Count | Frequency (%) |
| 0 | 369418 | |
| 1 | 270340 | |
| 2 | 183913 | |
| 3 | 206741 | |
| 4 | 215078 | |
| 5 | 263227 | |
| 6 | 343677 |
| Value | Count | Frequency (%) |
| 6 | 343677 | |
| 5 | 263227 | |
| 4 | 215078 | |
| 3 | 206741 | |
| 2 | 183913 | |
| 1 | 270340 | |
| 0 | 369418 |
hour
Real number (ℝ)
Zeros 
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.806119 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 60655 |
| Zeros (%) | 3.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 7 |
| median | 14 |
| Q3 | 19 |
| 95-th percentile | 23 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 6.8157529 |
|---|---|
| Coefficient of variation (CV) | 0.53222627 |
| Kurtosis | -1.0781663 |
| Mean | 12.806119 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.2834188 |
| Sum | 23721978 |
| Variance | 46.454488 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 95902 | 5.2% |
| 22 | 95370 | 5.1% |
| 16 | 94289 | 5.1% |
| 18 | 94052 | 5.1% |
| 21 | 93738 | 5.1% |
| 17 | 93514 | 5.0% |
| 13 | 93492 | 5.0% |
| 15 | 93439 | 5.0% |
| 19 | 93433 | 5.0% |
| 12 | 93294 | 5.0% |
| Other values (14) | 911871 |
| Value | Count | Frequency (%) |
| 0 | 60655 | |
| 1 | 61330 | |
| 2 | 60796 | |
| 3 | 60968 | |
| 4 | 59938 | |
| 5 | 60088 | |
| 6 | 60406 | |
| 7 | 60301 | |
| 8 | 60498 | |
| 9 | 60231 |
| Value | Count | Frequency (%) |
| 23 | 95902 | |
| 22 | 95370 | |
| 21 | 93738 | |
| 20 | 93081 | |
| 19 | 93433 | |
| 18 | 94052 | |
| 17 | 93514 | |
| 16 | 94289 | |
| 15 | 93439 | |
| 14 | 93089 |
year
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.1 MiB |
| 2020 | |
|---|---|
| 2019 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2019 |
|---|---|
| 2nd row | 2019 |
| 3rd row | 2019 |
| 4th row | 2019 |
| 5th row | 2019 |
Common Values
| Value | Count | Frequency (%) |
| 2020 | 927544 | |
| 2019 | 924850 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2020 | 927544 | |
| 2019 | 924850 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2779938 | |
| 0 | 2779938 | |
| 1 | 924850 | 12.5% |
| 9 | 924850 | 12.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7409576 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 2779938 | |
| 0 | 2779938 | |
| 1 | 924850 | 12.5% |
| 9 | 924850 | 12.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7409576 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 2779938 | |
| 0 | 2779938 | |
| 1 | 924850 | 12.5% |
| 9 | 924850 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7409576 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 2779938 | |
| 0 | 2779938 | |
| 1 | 924850 | 12.5% |
| 9 | 924850 | 12.5% |
times_shopped_at_merchant
Real number (ℝ)
High correlation 
| Distinct | 25 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.2980791 |
| Minimum | 1 |
|---|---|
| Maximum | 28 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 5 |
| Q3 | 7 |
| 95-th percentile | 11 |
| Maximum | 28 |
| Range | 27 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.0943453 |
|---|---|
| Coefficient of variation (CV) | 0.58405041 |
| Kurtosis | 1.3990129 |
| Mean | 5.2980791 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.0238642 |
| Sum | 9814130 |
| Variance | 9.5749728 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 261036 | |
| 3 | 259380 | |
| 5 | 238895 | |
| 2 | 217440 | |
| 6 | 200238 | |
| 7 | 158046 | |
| 1 | 124202 | |
| 8 | 122400 | |
| 9 | 87489 | 4.7% |
| 10 | 61750 | 3.3% |
| Other values (15) | 121518 |
| Value | Count | Frequency (%) |
| 1 | 124202 | |
| 2 | 217440 | |
| 3 | 259380 | |
| 4 | 261036 | |
| 5 | 238895 | |
| 6 | 200238 | |
| 7 | 158046 | |
| 8 | 122400 | |
| 9 | 87489 | 4.7% |
| 10 | 61750 | 3.3% |
| Value | Count | Frequency (%) |
| 28 | 28 | < 0.1% |
| 24 | 72 | < 0.1% |
| 23 | 46 | < 0.1% |
| 22 | 308 | < 0.1% |
| 21 | 462 | < 0.1% |
| 20 | 800 | < 0.1% |
| 19 | 988 | 0.1% |
| 18 | 2232 | |
| 17 | 3094 | |
| 16 | 4384 |
times_shopped_at_merchant_year
Real number (ℝ)
High correlation 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.1504594 |
| Minimum | 1 |
|---|---|
| Maximum | 17 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 7 |
| Maximum | 17 |
| Range | 16 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.8653693 |
|---|---|
| Coefficient of variation (CV) | 0.59209438 |
| Kurtosis | 1.7758299 |
| Mean | 3.1504594 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.1527118 |
| Sum | 5835892 |
| Variance | 3.4796026 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 454288 | |
| 3 | 388647 | |
| 1 | 355000 | |
| 4 | 275492 | |
| 5 | 173405 | 9.4% |
| 6 | 99906 | 5.4% |
| 7 | 53319 | 2.9% |
| 8 | 27648 | 1.5% |
| 9 | 13221 | 0.7% |
| 10 | 6010 | 0.3% |
| Other values (7) | 5458 | 0.3% |
| Value | Count | Frequency (%) |
| 1 | 355000 | |
| 2 | 454288 | |
| 3 | 388647 | |
| 4 | 275492 | |
| 5 | 173405 | 9.4% |
| 6 | 99906 | 5.4% |
| 7 | 53319 | 2.9% |
| 8 | 27648 | 1.5% |
| 9 | 13221 | 0.7% |
| 10 | 6010 | 0.3% |
| Value | Count | Frequency (%) |
| 17 | 17 | < 0.1% |
| 16 | 48 | < 0.1% |
| 15 | 150 | < 0.1% |
| 14 | 238 | < 0.1% |
| 13 | 689 | < 0.1% |
| 12 | 1368 | 0.1% |
| 11 | 2948 | 0.2% |
| 10 | 6010 | 0.3% |
| 9 | 13221 | |
| 8 | 27648 |
times_shopped_at_merchant_month
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3891094 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.67225585 |
|---|---|
| Coefficient of variation (CV) | 0.48394736 |
| Kurtosis | 4.875747 |
| Mean | 1.3891094 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.9826927 |
| Sum | 2573178 |
| Variance | 0.45192793 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1290726 | |
| 2 | 434458 | 23.5% |
| 3 | 101286 | 5.5% |
| 4 | 21056 | 1.1% |
| 5 | 3980 | 0.2% |
| 6 | 714 | < 0.1% |
| 7 | 140 | < 0.1% |
| 9 | 18 | < 0.1% |
| 8 | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1290726 | |
| 2 | 434458 | 23.5% |
| 3 | 101286 | 5.5% |
| 4 | 21056 | 1.1% |
| 5 | 3980 | 0.2% |
| 6 | 714 | < 0.1% |
| 7 | 140 | < 0.1% |
| 8 | 16 | < 0.1% |
| 9 | 18 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 18 | < 0.1% |
| 8 | 16 | < 0.1% |
| 7 | 140 | < 0.1% |
| 6 | 714 | < 0.1% |
| 5 | 3980 | 0.2% |
| 4 | 21056 | 1.1% |
| 3 | 101286 | 5.5% |
| 2 | 434458 | 23.5% |
| 1 | 1290726 |
times_shopped_at_merchant_day
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.6554416 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 14.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.90259006 |
|---|---|
| Coefficient of variation (CV) | 0.54522617 |
| Kurtosis | 3.4584246 |
| Mean | 1.6554416 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.6477251 |
| Sum | 3066530 |
| Variance | 0.81466882 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1030130 | |
| 2 | 543102 | |
| 3 | 196923 | 10.6% |
| 4 | 59592 | 3.2% |
| 5 | 16715 | 0.9% |
| 6 | 4536 | 0.2% |
| 7 | 1008 | 0.1% |
| 8 | 280 | < 0.1% |
| 9 | 108 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1030130 | |
| 2 | 543102 | |
| 3 | 196923 | 10.6% |
| 4 | 59592 | 3.2% |
| 5 | 16715 | 0.9% |
| 6 | 4536 | 0.2% |
| 7 | 1008 | 0.1% |
| 8 | 280 | < 0.1% |
| 9 | 108 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 108 | < 0.1% |
| 8 | 280 | < 0.1% |
| 7 | 1008 | 0.1% |
| 6 | 4536 | 0.2% |
| 5 | 16715 | 0.9% |
| 4 | 59592 | 3.2% |
| 3 | 196923 | 10.6% |
| 2 | 543102 | |
| 1 | 1030130 |
Interactions
Correlations
| amt | amt_month | amt_month_shopping_net_spend | amt_year | category | cc_num | city_pop | count_month_shopping_net | dist_between_client_and_merch | first_time_at_merchant | gender | hour | is_fraud | lat | long | merch_lat | merch_long | times_shopped_at_merchant | times_shopped_at_merchant_day | times_shopped_at_merchant_month | times_shopped_at_merchant_year | trans_day | trans_month | unix_time | year | zip | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| amt | 1.000 | 0.061 | 0.081 | 0.037 | 0.019 | -0.001 | -0.024 | -0.010 | -0.002 | 0.005 | 0.001 | -0.155 | 0.000 | 0.013 | -0.000 | 0.013 | -0.000 | 0.070 | 0.034 | 0.027 | 0.057 | 0.001 | -0.003 | -0.001 | 0.002 | 0.001 |
| amt_month | 0.061 | 1.000 | 0.732 | 0.472 | 0.016 | 0.008 | 0.013 | 0.821 | -0.000 | 0.167 | 0.106 | 0.067 | 0.030 | -0.012 | -0.011 | -0.012 | -0.011 | 0.257 | 0.130 | 0.148 | 0.212 | -0.016 | 0.189 | 0.129 | 0.011 | 0.017 |
| amt_month_shopping_net_spend | 0.081 | 0.732 | 1.000 | 0.348 | 0.014 | 0.010 | -0.002 | 0.777 | -0.000 | 0.065 | 0.091 | 0.048 | 0.091 | -0.005 | -0.017 | -0.004 | -0.017 | 0.175 | 0.090 | 0.099 | 0.144 | -0.012 | 0.121 | 0.084 | 0.023 | 0.025 |
| amt_year | 0.037 | 0.472 | 0.348 | 1.000 | 0.020 | 0.011 | 0.020 | 0.422 | 0.001 | 0.351 | 0.146 | 0.048 | 0.036 | -0.012 | -0.010 | -0.012 | -0.010 | 0.288 | 0.146 | 0.208 | 0.238 | -0.004 | 0.792 | 0.396 | 0.011 | 0.017 |
| category | 0.019 | 0.016 | 0.014 | 0.020 | 1.000 | 0.008 | 0.014 | 0.023 | 0.001 | 0.139 | 0.054 | 0.271 | 0.067 | 0.010 | 0.009 | 0.011 | 0.009 | 0.117 | 0.062 | 0.049 | 0.092 | 0.003 | 0.001 | 0.001 | 0.000 | 0.011 |
| cc_num | -0.001 | 0.008 | 0.010 | 0.011 | 0.008 | 1.000 | 0.049 | 0.017 | -0.000 | 0.019 | 0.052 | 0.011 | 0.003 | -0.003 | -0.013 | -0.003 | -0.013 | 0.005 | 0.004 | 0.003 | 0.005 | -0.000 | 0.001 | 0.001 | 0.000 | 0.013 |
| city_pop | -0.024 | 0.013 | -0.002 | 0.020 | 0.014 | 0.049 | 1.000 | -0.022 | 0.022 | 0.024 | 0.090 | 0.032 | 0.002 | -0.264 | 0.087 | -0.263 | 0.086 | -0.016 | -0.006 | -0.007 | -0.014 | 0.000 | -0.000 | -0.003 | 0.002 | -0.040 |
| count_month_shopping_net | -0.010 | 0.821 | 0.777 | 0.422 | 0.023 | 0.017 | -0.022 | 1.000 | -0.000 | 0.169 | 0.125 | 0.076 | 0.014 | -0.009 | -0.026 | -0.009 | -0.026 | 0.260 | 0.133 | 0.145 | 0.215 | -0.014 | 0.174 | 0.117 | 0.009 | 0.029 |
| dist_between_client_and_merch | -0.002 | -0.000 | -0.000 | 0.001 | 0.001 | -0.000 | 0.022 | -0.000 | 1.000 | 0.000 | 0.005 | 0.001 | 0.000 | -0.070 | -0.003 | -0.070 | -0.003 | 0.000 | 0.000 | 0.000 | 0.000 | -0.000 | 0.000 | -0.000 | 0.001 | 0.007 |
| first_time_at_merchant | 0.005 | 0.167 | 0.065 | 0.351 | 0.139 | 0.019 | 0.024 | 0.169 | 0.000 | 1.000 | 0.044 | 0.021 | 0.028 | 0.027 | 0.016 | 0.020 | 0.013 | 0.387 | 0.209 | 0.207 | 0.365 | 0.033 | 0.282 | 0.509 | 0.377 | 0.025 |
| gender | 0.001 | 0.106 | 0.091 | 0.146 | 0.054 | 0.052 | 0.090 | 0.125 | 0.005 | 0.044 | 1.000 | 0.045 | 0.006 | 0.101 | 0.091 | 0.103 | 0.083 | 0.131 | 0.071 | 0.055 | 0.107 | 0.007 | 0.002 | 0.000 | 0.001 | 0.116 |
| hour | -0.155 | 0.067 | 0.048 | 0.048 | 0.271 | 0.011 | 0.032 | 0.076 | 0.001 | 0.021 | 0.045 | 1.000 | 0.090 | -0.011 | -0.005 | -0.010 | -0.005 | 0.023 | 0.009 | 0.005 | 0.017 | 0.001 | -0.001 | 0.001 | 0.001 | 0.006 |
| is_fraud | 0.000 | 0.030 | 0.091 | 0.036 | 0.067 | 0.003 | 0.002 | 0.014 | 0.000 | 0.028 | 0.006 | 0.090 | 1.000 | 0.038 | 0.038 | 0.038 | 0.038 | 0.031 | 0.018 | 0.013 | 0.025 | 0.012 | 0.021 | 0.022 | 0.006 | 0.004 |
| lat | 0.013 | -0.012 | -0.005 | -0.012 | 0.010 | -0.003 | -0.264 | -0.009 | -0.070 | 0.027 | 0.101 | -0.011 | 0.038 | 1.000 | 0.105 | 0.991 | 0.104 | -0.013 | -0.005 | -0.004 | -0.009 | 0.001 | -0.000 | 0.001 | 0.003 | -0.162 |
| long | -0.000 | -0.011 | -0.017 | -0.010 | 0.009 | -0.013 | 0.087 | -0.026 | -0.003 | 0.016 | 0.091 | -0.005 | 0.038 | 0.105 | 1.000 | 0.105 | 0.998 | -0.016 | -0.008 | -0.007 | -0.014 | 0.001 | -0.001 | -0.001 | 0.003 | -0.959 |
| merch_lat | 0.013 | -0.012 | -0.004 | -0.012 | 0.011 | -0.003 | -0.263 | -0.009 | -0.070 | 0.020 | 0.103 | -0.010 | 0.038 | 0.991 | 0.105 | 1.000 | 0.104 | -0.012 | -0.005 | -0.004 | -0.009 | 0.001 | -0.000 | 0.001 | 0.003 | -0.162 |
| merch_long | -0.000 | -0.011 | -0.017 | -0.010 | 0.009 | -0.013 | 0.086 | -0.026 | -0.003 | 0.013 | 0.083 | -0.005 | 0.038 | 0.104 | 0.998 | 0.104 | 1.000 | -0.016 | -0.008 | -0.007 | -0.013 | 0.001 | -0.001 | -0.001 | 0.003 | -0.957 |
| times_shopped_at_merchant | 0.070 | 0.257 | 0.175 | 0.288 | 0.117 | 0.005 | -0.016 | 0.260 | 0.000 | 0.387 | 0.131 | 0.023 | 0.031 | -0.013 | -0.016 | -0.012 | -0.016 | 1.000 | 0.498 | 0.392 | 0.817 | 0.002 | -0.000 | -0.000 | 0.000 | 0.019 |
| times_shopped_at_merchant_day | 0.034 | 0.130 | 0.090 | 0.146 | 0.062 | 0.004 | -0.006 | 0.133 | 0.000 | 0.209 | 0.071 | 0.009 | 0.018 | -0.005 | -0.008 | -0.005 | -0.008 | 0.498 | 1.000 | 0.201 | 0.414 | -0.016 | 0.001 | 0.001 | 0.000 | 0.010 |
| times_shopped_at_merchant_month | 0.027 | 0.148 | 0.099 | 0.208 | 0.049 | 0.003 | -0.007 | 0.145 | 0.000 | 0.207 | 0.055 | 0.005 | 0.013 | -0.004 | -0.007 | -0.004 | -0.007 | 0.392 | 0.201 | 1.000 | 0.323 | -0.001 | 0.117 | 0.058 | 0.000 | 0.008 |
| times_shopped_at_merchant_year | 0.057 | 0.212 | 0.144 | 0.238 | 0.092 | 0.005 | -0.014 | 0.215 | 0.000 | 0.365 | 0.107 | 0.017 | 0.025 | -0.009 | -0.014 | -0.009 | -0.013 | 0.817 | 0.414 | 0.323 | 1.000 | 0.001 | -0.000 | 0.001 | 0.006 | 0.016 |
| trans_day | 0.001 | -0.016 | -0.012 | -0.004 | 0.003 | -0.000 | 0.000 | -0.014 | -0.000 | 0.033 | 0.007 | 0.001 | 0.012 | 0.001 | 0.001 | 0.001 | 0.001 | 0.002 | -0.016 | -0.001 | 0.001 | 1.000 | -0.005 | -0.069 | 0.125 | -0.001 |
| trans_month | -0.003 | 0.189 | 0.121 | 0.792 | 0.001 | 0.001 | -0.000 | 0.174 | 0.000 | 0.282 | 0.002 | -0.001 | 0.021 | -0.000 | -0.001 | -0.000 | -0.001 | -0.000 | 0.001 | 0.117 | -0.000 | -0.005 | 1.000 | 0.498 | 0.008 | 0.001 |
| unix_time | -0.001 | 0.129 | 0.084 | 0.396 | 0.001 | 0.001 | -0.003 | 0.117 | -0.000 | 0.509 | 0.000 | 0.001 | 0.022 | 0.001 | -0.001 | 0.001 | -0.001 | -0.000 | 0.001 | 0.058 | 0.001 | -0.069 | 0.498 | 1.000 | 0.998 | 0.001 |
| year | 0.002 | 0.011 | 0.023 | 0.011 | 0.000 | 0.000 | 0.002 | 0.009 | 0.001 | 0.377 | 0.001 | 0.001 | 0.006 | 0.003 | 0.003 | 0.003 | 0.003 | 0.000 | 0.000 | 0.000 | 0.006 | 0.125 | 0.008 | 0.998 | 1.000 | 0.002 |
| zip | 0.001 | 0.017 | 0.025 | 0.017 | 0.011 | 0.013 | -0.040 | 0.029 | 0.007 | 0.025 | 0.116 | 0.006 | 0.004 | -0.162 | -0.959 | -0.162 | -0.957 | 0.019 | 0.010 | 0.008 | 0.016 | -0.001 | 0.001 | 0.001 | 0.002 | 1.000 |
Missing values
Sample
| cc_num | merchant | category | amt | first | last | gender | street | city | state | zip | lat | long | city_pop | job | dob | trans_num | unix_time | merch_lat | merch_long | is_fraud | amt_month | amt_year | amt_month_shopping_net_spend | count_month_shopping_net | first_time_at_merchant | dist_between_client_and_merch | trans_month | trans_day | hour | year | times_shopped_at_merchant | times_shopped_at_merchant_year | times_shopped_at_merchant_month | times_shopped_at_merchant_day | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2703186189652095 | fraud_Rippin, Kub and Mann | misc_net | 4.97 | Jennifer | Banks | F | 561 Perry Cove | Moravian Falls | NC | 28654 | 36.0788 | -81.1781 | 3495 | Psychologist, counselling | 1988-03-09 | 0b242abb623afc578575680df30655b9 | 1325376018 | 36.011293 | -82.048315 | 0 | 4.97 | 4.97 | 0.0 | 0.0 | True | 78.773821 | 1 | 1 | 0 | 2019 | 5 | 4 | 2 | 1 |
| 1 | 630423337322 | fraud_Heller, Gutmann and Zieme | grocery_pos | 107.23 | Stephanie | Gill | F | 43039 Riley Greens Suite 393 | Orient | WA | 99160 | 48.8878 | -118.2105 | 149 | Special educational needs teacher | 1978-06-21 | 1f76529f8574734946361c461b024d99 | 1325376044 | 49.159047 | -118.186462 | 0 | 107.23 | 107.23 | 0.0 | 0.0 | True | 30.216618 | 1 | 1 | 0 | 2019 | 4 | 4 | 1 | 1 |
| 2 | 38859492057661 | fraud_Lind-Buckridge | entertainment | 220.11 | Edward | Sanchez | M | 594 White Dale Suite 530 | Malad City | ID | 83252 | 42.1808 | -112.2620 | 4154 | Nature conservation officer | 1962-01-19 | a1a22d70485983eac12b5b88dad1cf95 | 1325376051 | 43.150704 | -112.154481 | 0 | 220.11 | 220.11 | 0.0 | 0.0 | True | 108.102912 | 1 | 1 | 0 | 2019 | 4 | 3 | 1 | 1 |
| 3 | 3534093764340240 | fraud_Kutch, Hermiston and Farrell | gas_transport | 45.00 | Jeremy | White | M | 9443 Cynthia Court Apt. 038 | Boulder | MT | 59632 | 46.2306 | -112.1138 | 1939 | Patent attorney | 1967-01-12 | 6b849c168bdad6f867558c3793159a81 | 1325376076 | 47.034331 | -112.561071 | 0 | 45.00 | 45.00 | 0.0 | 0.0 | True | 95.685115 | 1 | 1 | 0 | 2019 | 1 | 1 | 1 | 1 |
| 4 | 375534208663984 | fraud_Keeling-Crist | misc_pos | 41.96 | Tyler | Garcia | M | 408 Bradley Rest | Doe Hill | VA | 24433 | 38.4207 | -79.4629 | 99 | Dance movement psychotherapist | 1986-03-28 | a41d7549acf90789359a9aa5346dcb46 | 1325376186 | 38.674999 | -78.632459 | 0 | 41.96 | 41.96 | 0.0 | 0.0 | True | 77.702395 | 1 | 1 | 0 | 2019 | 6 | 1 | 1 | 1 |
| 5 | 4767265376804500 | fraud_Stroman, Hudson and Erdman | gas_transport | 94.63 | Jennifer | Conner | F | 4655 David Island | Dublin | PA | 18917 | 40.3750 | -75.2045 | 2158 | Transport planner | 1961-06-19 | 189a841a0a8ba03058526bcfe566aab5 | 1325376248 | 40.653382 | -76.152667 | 0 | 94.63 | 94.63 | 0.0 | 0.0 | True | 86.097358 | 1 | 1 | 0 | 2019 | 2 | 2 | 1 | 1 |
| 6 | 30074693890476 | fraud_Rowe-Vandervort | grocery_net | 44.54 | Kelsey | Richards | F | 889 Sarah Station Suite 624 | Holcomb | KS | 67851 | 37.9931 | -100.9893 | 2691 | Arboriculturist | 1993-08-16 | 83ec1cc84142af6e2acf10c44949e720 | 1325376282 | 37.162705 | -100.153370 | 0 | 44.54 | 44.54 | 0.0 | 0.0 | True | 118.094855 | 1 | 1 | 0 | 2019 | 4 | 4 | 1 | 1 |
| 7 | 6011360759745864 | fraud_Corwin-Collins | gas_transport | 71.65 | Steven | Williams | M | 231 Flores Pass Suite 720 | Edinburg | VA | 22824 | 38.8432 | -78.6003 | 6018 | Designer, multimedia | 1947-08-21 | 6d294ed2cc447d2c71c7171a3d54967c | 1325376308 | 38.948089 | -78.540296 | 0 | 71.65 | 71.65 | 0.0 | 0.0 | True | 12.754714 | 1 | 1 | 0 | 2019 | 3 | 2 | 1 | 1 |
| 8 | 4922710831011201 | fraud_Herzog Ltd | misc_pos | 4.27 | Heather | Chase | F | 6888 Hicks Stream Suite 954 | Manor | PA | 15665 | 40.3359 | -79.6607 | 1472 | Public affairs consultant | 1941-03-07 | fc28024ce480f8ef21a32d64c93a29f5 | 1325376318 | 40.351813 | -79.958146 | 0 | 4.27 | 4.27 | 0.0 | 0.0 | True | 25.333883 | 1 | 1 | 0 | 2019 | 2 | 1 | 1 | 1 |
| 9 | 2720830304681674 | fraud_Schoen, Kuphal and Nitzsche | grocery_pos | 198.39 | Melissa | Aguilar | F | 21326 Taylor Squares Suite 708 | Clarksville | TN | 37040 | 36.5220 | -87.3490 | 151785 | Pathologist | 1974-03-28 | 3b9014ea8fb80bd65de0b1463b00b00e | 1325376361 | 37.179198 | -87.485381 | 0 | 198.39 | 198.39 | 0.0 | 0.0 | True | 73.939714 | 1 | 1 | 0 | 2019 | 5 | 4 | 1 | 2 |
| cc_num | merchant | category | amt | first | last | gender | street | city | state | zip | lat | long | city_pop | job | dob | trans_num | unix_time | merch_lat | merch_long | is_fraud | amt_month | amt_year | amt_month_shopping_net_spend | count_month_shopping_net | first_time_at_merchant | dist_between_client_and_merch | trans_month | trans_day | hour | year | times_shopped_at_merchant | times_shopped_at_merchant_year | times_shopped_at_merchant_month | times_shopped_at_merchant_day | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1852384 | 30344654314976 | fraud_Larkin, Stracke and Greenfelder | entertainment | 46.71 | Christine | Johnson | F | 8011 Chapman Tunnel Apt. 568 | Blairsden-Graeagle | CA | 96103 | 39.8127 | -120.6405 | 1725 | Chartered legal executive (England and Wales) | 1967-05-27 | a7105564935ea3977dc61ff9ced3bf5e | 1388534238 | 38.963543 | -120.457121 | 0 | 7420.57 | 41336.21 | 1706.73 | 9.0 | False | 95.590341 | 12 | 3 | 23 | 2020 | 3 | 2 | 2 | 1 |
| 1852385 | 3524574586339330 | fraud_Heathcote, Yost and Kertzmann | shopping_net | 29.56 | Ashley | Cabrera | F | 94225 Smith Springs Apt. 617 | Vero Beach | FL | 32960 | 27.6330 | -80.4031 | 105638 | Librarian, public | 1986-05-07 | 9fc9f6f9be3182d519a61a119cf97199 | 1388534270 | 27.593881 | -80.855092 | 0 | 14501.28 | 99329.66 | 2299.32 | 20.0 | False | 44.826486 | 12 | 3 | 23 | 2020 | 3 | 2 | 2 | 1 |
| 1852386 | 341546199006537 | fraud_Schmidt-Larkin | home | 12.68 | Mark | Brown | M | 8580 Moore Cove | Wales | AK | 99783 | 64.7556 | -165.6723 | 145 | Administrator, education | 1939-11-09 | a8310343c189e4a5b6316050d2d6b014 | 1388534276 | 65.623593 | -165.186033 | 0 | 8706.23 | 67211.98 | 68.22 | 5.0 | False | 99.420757 | 12 | 3 | 23 | 2020 | 4 | 4 | 1 | 1 |
| 1852387 | 501802953619 | fraud_Pouros, Walker and Spencer | kids_pets | 13.02 | Robert | Flores | M | 3277 Fields Meadows Apt. 790 | Greenview | CA | 96037 | 41.5403 | -122.9366 | 308 | Call centre manager | 1958-09-20 | bd7071fd5c9510a5594ee196368ac80e | 1388534284 | 41.973127 | -123.553032 | 0 | 9016.43 | 65502.89 | 1161.11 | 17.0 | False | 70.279450 | 12 | 3 | 23 | 2020 | 4 | 2 | 1 | 1 |
| 1852388 | 3523843138706408 | fraud_Prosacco, Kreiger and Kovacek | home | 17.00 | Grace | Williams | F | 28812 Charles Mill Apt. 628 | Plantersville | AL | 36758 | 32.6176 | -86.9475 | 1412 | Drilling engineer | 1970-11-20 | 6d04313bfe4b661b8ca2b6a499a320fe | 1388534314 | 32.164145 | -87.539669 | 0 | 13874.19 | 78212.66 | 1393.06 | 15.0 | False | 75.053155 | 12 | 3 | 23 | 2020 | 7 | 3 | 2 | 2 |
| 1852389 | 30560609640617 | fraud_Reilly and Sons | health_fitness | 43.77 | Michael | Olson | M | 558 Michael Estates | Luray | MO | 63453 | 40.4931 | -91.8912 | 519 | Town planner | 1966-02-13 | 9b1f753c79894c9f4b71f04581835ada | 1388534347 | 39.946837 | -91.333331 | 0 | 11619.63 | 72134.23 | 1014.44 | 11.0 | False | 77.032467 | 12 | 3 | 23 | 2020 | 6 | 3 | 1 | 1 |
| 1852390 | 3556613125071656 | fraud_Hoppe-Parisian | kids_pets | 111.84 | Jose | Vasquez | M | 572 Davis Mountains | Lake Jackson | TX | 77566 | 29.0393 | -95.4401 | 28739 | Futures trader | 1999-12-27 | 2090647dac2c89a1d86c514c427f5b91 | 1388534349 | 29.661049 | -96.186633 | 0 | 15224.47 | 87115.43 | 3942.78 | 25.0 | False | 100.023736 | 12 | 3 | 23 | 2020 | 5 | 3 | 1 | 1 |
| 1852391 | 6011724471098086 | fraud_Rau-Robel | kids_pets | 86.88 | Ann | Lawson | F | 144 Evans Islands Apt. 683 | Burbank | WA | 99323 | 46.1966 | -118.9017 | 3684 | Musician | 1981-11-29 | 6c5b7c8add471975aa0fec023b2e8408 | 1388534355 | 46.658340 | -119.715054 | 0 | 26233.12 | 165389.30 | 2978.91 | 29.0 | False | 80.887812 | 12 | 3 | 23 | 2020 | 10 | 7 | 1 | 2 |
| 1852392 | 4079773899158 | fraud_Breitenberg LLC | travel | 7.99 | Eric | Preston | M | 7020 Doyle Stream Apt. 951 | Mesa | ID | 83643 | 44.6255 | -116.4493 | 129 | Cartographer | 1965-12-15 | 14392d723bb7737606b2700ac791b7aa | 1388534364 | 44.470525 | -117.080888 | 0 | 11787.71 | 90698.65 | 768.69 | 17.0 | False | 53.060882 | 12 | 3 | 23 | 2020 | 4 | 2 | 2 | 1 |
| 1852393 | 4170689372027579 | fraud_Dare-Marvin | entertainment | 38.13 | Samuel | Frey | M | 830 Myers Plaza Apt. 384 | Edmond | OK | 73034 | 35.6665 | -97.4798 | 116001 | Media buyer | 1993-05-10 | 1765bb45b3aa3224b4cdcb6e7a96cee3 | 1388534374 | 36.210097 | -97.036372 | 0 | 13871.45 | 116400.29 | 883.31 | 18.0 | False | 72.380990 | 12 | 3 | 23 | 2020 | 2 | 1 | 1 | 1 |